Refining Genetically Inferred Relationships Using Treelet Covariance Smoothing.
نویسندگان
چکیده
Recent technological advances coupled with large sample sets have uncovered many factors underlying the genetic basis of traits and the predisposition to complex disease, but much is left to discover. A common thread to most genetic investigations is familial relationships. Close relatives can be identified from family records, and more distant relatives can be inferred from large panels of genetic markers. Unfortunately these empirical estimates can be noisy, especially regarding distant relatives. We propose a new method for denoising genetically-inferred relationship matrices by exploiting the underlying structure due to hierarchical groupings of correlated individuals. The approach, which we call Treelet Covariance Smoothing, employs a multiscale decomposition of covariance matrices to improve estimates of pairwise relationships. On both simulated and real data, we show that smoothing leads to better estimates of the relatedness amongst distantly related individuals. We illustrate our method with a large genome-wide association study and estimate the "heritability" of body mass index quite accurately. Traditionally heritability, defined as the fraction of the total trait variance attributable to additive genetic effects, is estimated from samples of closely related individuals using random effects models. We show that by using smoothed relationship matrices we can estimate heritability using population-based samples. Finally, while our methods have been developed for refining genetic relationship matrices and improving estimates of heritability, they have much broader potential application in statistics. Most notably, for error-in-variables random effects models and settings that require regularization of matrices with block or hierarchical structure.
منابع مشابه
Rejoinder of : Treelets — an Adaptive Multi - Scale Basis for Spare Unordered Data
1. A multiresolution transform guided by the second-order statistics of the data. The treelet transform is a multiresolution transform that allows one to represent the original data in an alternative form. Rather than describe the data in terms of the original set of covariates, we perform a series of rotations which gradually reveal the hierarchical grouping structure of the covariates. The id...
متن کاملRejoinder: Treelets
1. A multiresolution transform guided by the second-order statistics of the data. The treelet transform is a multiresolution transform that allows one to represent the original data in an alternative form. Rather than describe the data in terms of the original set of covariates we perform a series of rotations which gradually reveal the hierarchical grouping structure of the covariates. The ide...
متن کاملRejoinder Of: Treelets—an Adaptive Multi-scale Basis for Spare Unordered Data by Ann
1. A multiresolution transform guided by the second-order statistics of the data. The treelet transform is a multiresolution transform that allows one to represent the original data in an alternative form. Rather than describe the data in terms of the original set of covariates, we perform a series of rotations which gradually reveal the hierarchical grouping structure of the covariates. The id...
متن کاملFast covariance estimation for sparse functional data
Smoothing of noisy sample covariances is an important component in functional data analysis. We propose a novel covariance smoothing method based on penalized splines and associated software. The proposed method is a bivariate spline smoother that is designed for covariance smoothing and can be used for sparse functional or longitudinal data. We propose a fast algorithm for covariance smoothing...
متن کاملVariance decomposition of MRI-based covariance maps using genetically informative samples and structural equation modeling
The role of genetics in driving intracortical relationships is an important question that has rarely been studied in humans. In particular, there are no extant high-resolution imaging studies on genetic covariance. In this article, we describe a novel method that combines classical quantitative genetic methodologies for variance decomposition with recently developed semi-multivariate algorithms...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- The annals of applied statistics
دوره 7 2 شماره
صفحات -
تاریخ انتشار 2013